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(57) Abstract 



An isolated, substantially homogenous hK2 polypeptide is provided as well as isolated nucleic acid molecules encoding hK2 
polypeptide, including (a) a cDNA molecule comprising the nucleotide sequence of the coding region of human hK2 gene; (b) a DNA 
molecule capable of hybridizing under stringent conditions to a molecule of (a); and (c) a genetic variant of any of the DNA molecules of 
(a) and (b) which encodes of polypeptide processing an antigenic function of naturally occurring polypeptide. 
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RECOMBINANT HK2 POLYPEPTIDE 

5 Background of foe Jfovefltipn 

The glandular kallikreins are a subgroup of serine proteases 
which are involved in the post-translational processing of specific polypeptide 
precursors to their biologically active forms. The human kallikrein gene 
family consists of three members: prostate-specific antigen, human glandular 

10 kallikrein, and pancreatic/renal kallikrein. See J. A. Clements, Endocr. Rev.. 
If), 393 (1989) and TM Chu et al. (U.S. Patent No. 4,446,122). A common 
nomenclature for these members of the tissue (glandular) kallikrein gene 
families was recently adopted by T. Berg et al., in Recent Progress on Kinins: 
geochemistry and Molecular Biology of the Kallikrei n-Kinin System. Agents 

15 and Actions Supplements. Vol. I. T H. Fritz et al.., eds., Birkkauser Verlag, 
Basel (1992), and is defined in Table I, below. 
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The DNA sequence homology between hKLK2 and hKLK3 
(exon regions) is 80%, whereas the homology between hKLK2 and hKLKl is 
65% The deduced amino acid sequence homology of hK2 to hKl is 57% 
Amino acid sequences deduced by LJ. Schedlich et al., DNA. 6, 429 (1987) 
5 and B J. Morris, Clin. Exp. Pharmacol. Physiol 16, 345 (1989) indicate that 
hK2 may be a trypsin-like serine protease, whereas hK3 (PSA) is a 
chymotrypsin-like serine protease. Therefore, if hK2 is indeed secretory, it 
may have a different physiological function than hK3. 

The hKLK2 gene is located about 12 kbp downstream from the 

10 hKLK3 gene in a head-to-tail fashion on chromosome 19. (P.H. Riegman et 
al., FEBS Lett., 247, 123, (1989)). The similarities of gene structure and 
deduced amino acid sequences of these human kallikreins suggest that then- 
evolution may involve the same ancestral gene. Most interestingly, as 
reported by Morris, cited supm; P. Chapdelaine, FEBS Lett. 236 : 205 (1988); 

15 and Young, Biochemistry. 31. 1952 (1992), both hK2 and hK3 may be 
expressed only in the human prostate, while expression of hKl is limited to 
the pancreas, submandibular gland, kidney, and other nonprostate tissues. 

Tremendous interest has been generated in hK3 (PSA) because 
of the important role it plays as a marker to detect and to monitor progression 

20 of prostate carcinoma Its usefulness as a marker is based on the elevated 
serum concentration of circulating hK3 proteins which are frequently 
associated with prostatic cancer. The serum concentration of hK3 has been 
found to be proportional to the cancer mass in untreated patients, but is also 
proportional to the volume of hyperplastic tissue in patients with benign 

25 prostatic hyperplasia (BPH). The serum levels of hK3 become reduced 
following prostate cancer therapy. 

Despite the information which can be ascertained about hK2 
from the genomic DNA sequence, very little is known about the hK2 
polypeptide itself. The reason for this is that the protein has not been purified 

30 and characterized Thus, a need exists for a method to obtain hK2 

polypeptide and related polypeptides in sufficient quantity and purity for 
characterization and for use as therapeutic/diagnostic agents or reagents. 
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Summary p f the Invention 
The present invention provides an isolated, substantially 
homogenous hK2 polypeptide. As used herein, in the term "hK2 polypeptide" 
includes pre-pro hK2, pro hK2 and mature hK2 polypeptides. Pre-pro hK2 is 
5 secreted by the cell in vivo, and is cleaved during secretion to yield pro hK2, 
which is then enzymatically cleaved in the extracellular environment to yield 
"mature" hK2. Most preferably, the hK2 polypeptide is contiguous in amino 
acid sequence with SEQ ID NO: 16, SEQ IDNO: 6 or SEQ ID NO: 10. 

The present invention also provides isolated nucleic acid 
10 molecules encoding hK2 polypeptide, including (a) a cDNA molecule 

comprising the nucleotide sequence of the coding region of the hK2 gene; (b) 
a DNA molecule capable of hybridizing under stringent conditions to a 
nucleotide sequence complementary to the nucleotide sequence of (a); and (c) 
a genetic variant of any of the DNA molecules of (a) and (b) which encodes 
15 of polypeptide processing an antigenic function of naturally occurring hK2 
polypeptide. Preferably, the nucleic acid comprises a discrete, isolated DNA 
or RNA molecule encoding the complete hK2 polypeptide, which can include 
the pre-pro, pro or mature forms. Most preferably, the nucleic acid is a DNA 
sequence contiguous with SEQ ED NO: 5, 7 or 9, i.e., as shown in Figs. 5, 6 
20 or 7. These DNA sequences can be produced using the polymerase chain 
reaction (PCR), and novel oligonucleotide primers employed in the synthesis 
are also an embodiment of the invention. 

The nucleic acid sequence also can comprise a promoter 
operably linked to the nucleic acid sequence. Therefore, the invention also 
25 comprises a chimeric expression vector comprising the above-described 

nucleic acid sequence, operationally linked to control sequences recognized by 
a host cell transformed with the vector, as well as said transformed host cell, 
and methods of its preparation and use to produce recombinant hK2. Thus, 
the present invention also provides a method of using a nucleic acid molecule, 
30 such as a cDNA clone encoding hK2 polypeptide, comprising expressing the 
nucleic acid molecule in a cultured host cell transformed, preferably stably 
transformed, with a chimeric expression vector comprising said nucleic acid 
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molecule operably linked to control sequences recognized by the host cell 
transformed with the vector, and recovering the hK2 polypeptide from the 
transgenic host cell, i.e., from the culture medium. As used herein, the term 
"chimeric" means that the vector comprises DNA from at least two different 
5 species, or coirpises DNA from the same species, which is linked or 
associated in a manner which does not occur in the "native" form of said 
species. 

More specifically, E. coli and baculovirus insect cells systems 
have been employed to produce hK2 polypeptides in two forms, i.e. pre-pro 

10 hK2 (pphK2) and mature hK2 (mhK2). Thus, the present invention provides 
the first example of the overexpression of hK2 in heterologous systems. 
However, although pphK2 produced in E. coli has proven to be an invaluable 
resource for generating antibodies to the denatured form of the protein, it is 
desirable to both discern the steps involved in the biosynthesis of hK2 and to 

15 obtain antibodies specific for the fully processed and secreted form of the 
protein. Therefore, mammalian cell systems have been employed to produce 
hK2 polypeptides. Thus, the present invention also provides the first example 
of the expression of hK2 in mammalian cells and purification and 
characterization of the secreted protein. 

20 The high degree of amino acid sequence homology of hK2 with 

hK3 indicates that measuring serum concentrations of both proteins may be 
useful in the diagnosis and monitoring of prostate cancer. For exanple, the 
antibodies developed against hK3 now used in these assays could theoretically 
also recognize hK2, because of mutual contamination in the antigenic 

25 preparations used to develop the anti-hK3 antibodies or because of antibody 
cross-reactivity between these two proteins. This could account for the 
substantial percentage of false positive results which are observed in current 
hK3 assays. On the other hand, if circulating hK2 levels are also elevated 
above baseline levels in prostate cancer patients, detection of hK2 by hK2- 

30 specific antibodies could provide an alternative, confirmatory assay for 
prostate cancer. 
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Therefore, hK2 polypeptide, as well as variants and subunits 
thereof, produced by the present method can be used to produce populations 
of antibodies that, in turn, can be used as the basis for assays to detect and 
quantify hK2 polypeptide (or "protein") in samples derived from tissues such 

5 as prostate carcinomas, cells such as prostate cell lines, or from fluids such as 
seminal fluid or blood. Thus, the present invention also provides populations 
of monoclonal or polyclonal antibodies that specifically bind to hK2 
polypeptide, while not significantly binding to hK3. The term "significantly" 
is defined by reference to the comparative assays discussed below. These 

10 antibodies can also be used in affinity chromatography, to purify mammalian 
hK2 from natural sources. The isolated, substantially homogeneous hK2 can 
also be employed as a component in diagnostic assays for "native" hK2 in 
samples derived from human tissues or physiological fluids. For example, the 
recombinant hK2 can be bound to a detectable label and employed in 

15 competitive immunoassays for hK2, as described in U.S. patent application 
Serial No. 08/096,946, filed July 22, 1993. 

As used herein with respect to the present invention, the terms 
"hK2 polypeptide," "hK2 protein," and "hK2" are considered to refer to 
identical human materials, unless otherwise indicated 

20 

Brief Description of t he Figures 
Figure 1 depicts a time course study of recombinant pphK2 in 
s/9 cells infected with recombinant pphK2 virus. At each of the time points 
cells were depleted of methionine and cysteine for 1 hour in deficient media 
25 and then supplemented with [ 35 S]-methione and [ 35 S]-cysteine. Protein was 
determined by Bradford assay. Aliquots of protein (20 |ag) were loaded onto 
a 12% Tris-Glycine SDS gel. A Phosphorimager cassette was exposed 
overnight. The band of interest is indicated with an arrow, w.t: wild type. 

Figure 2 depicts the detection of recombinant mhK2 in cell 
30 lysate fractions. S/9 cells were infected either with recombinant mhK2, wild 
type or left uninfected for 48 hours. Methionine and cysteine pools were 
depleted for 1 hour in deficient media. Cells were supplemented with [ 35 S]- 
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methionine and [ 35 S]-cysteine for 6 hours. Cells were separated into soluble 
and insoluble fractions using H 2 0 and repeated freeze/thaw conditions. 
Aliquots of protein (50 ug per lane) were loaded onto a 10% Tris-Glycine 
SDS gel and electrophoresed The gel was dried and exposed to x-ray film 
5 for 2 days. The band of interest is indicated with an arrow. 

Figure 3 depicts the expression of recombinant pphK2 in E 
coli. E. coli strain BL21 (DE3) LysS harboring pBppHK2 was grown in LB 
media to O.D.^o 0.2 and incubated without (lane 2, not-induced (N)) or with 
(lane 3, induced (I)) 0.4 mM IPTG for 2 hrs. Cells were lysed in sample 
10 buffer and subjected to SDS/PAGE on a 4-20% gradient gel. Protein bands 
were visualized by staining the gel with Coomassie blue. 

Figure 4 depicts the amino acid sequences of mature hK2 
(deduced from cDNA sequence, SEQ ID NO: 16) and hK3 (SEQ ID NO: 1). 
Underlined sequences denote nonhomologous regions that can be used for 
15 preparation of antibodies specific to hK2. 

Figure 5 depicts pphK2 cDNA containing a BamHl site at the 
5' end and a Pstl site at the 3' end (SEQ ID NO: 5) (coding strand is 
numbered) as well as the amino acid sequence of pre-pro hK2 encoded 
thereby (SEQ ID NO: 6). The amino acid sequences of pro hK2 and mature 
20 hK2 are also shown on the Figure. 

Figure 6 depicts mhK2 cDNA containing an EcoRl site at the 
5' end and Pstl site at the 3' end (SEQ ID NO: 7), as well as the 
corresponding amino acid sequence (SEQ ID NO: 8) which encompasses the 
amino acid sequence of mhK2 polypeptide. 
25 Figure 7 depicts pro hK2 DNA (SEQ ID NO: 9) (coding strand 

is numbered) and the amino acid sequence of pro hK2 (SEQ ID NO: 10). 

Figure 8 depicts a gel confirming the expression of recombinant 
pphK2 in a mammalian cell line. AV12-pGThK2 (Lane 4-6) and AV12-pGT- 
d (Lane 3) clonal cell lines were grown in D10F media About 300ul of 
30 spent medium from the above clones were concentrated and subjected to 
SDS/PAGE along with See Blue MW marker (lane 1) and pphK2 lysate from 
E. coli cells (lane 2). The gel was blotted onto nitrocellulose paper and 
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immunoblotted using a 1/1000 dilution of anti-pphK2 rabbit antiserum. HRP- 
goat anti-rabbit was used as the secondary probe and the blot was developed 
by DAB plus H 2 0 2 . Lane 3 (AV12-pGT-d) is AV12 transfected with vector 
without insert. 

5 Figure 9 depicts the DEAE chromatography of AVI 2 media 

The sample was applied in a bicaibonte buffer, pH 8 and eluted with a salt 
gradient. The solid line is the A^o elution profile. The triangle line 
represents the ELISA assay of individual samples which had been dried onto 
microtiter plates and developed with rabbit anti-hK2 antibody. 

10 Figure 10 depicts the hydrophobic interaction profile of DEAE 

fractions. The fractions were pooled, concentrated and applied to an HIC 
column in 1.2 M sodium sulfate, and eluted with a decreasing salt gradient. 
The solid line is A^o and the triangle line shows the ELISA assay profile of 
the fractions using rabbit anti-hK2 antibody. 

15 Figure 1 1 depicts the Size Exclusion Chromatography of HIC 

purified prohK2, in particular, the A m profile of 22 min peak eluted off HIC 
column. The 19.4 min peak appears homogeneous by SDS-PAGE. After this 
peak was lyophilized, the N-terminal sequence and amino acid composition 
confirmed its identity as the pro form of hK2. 

20 Figure 12 depicts the SDS/PAGE analysis of prohK2 and PSA. 

l.5\ig of purified phK2 or PSA was boiled in sample buffer containing (R) or 
not containing (N) 1% BME. Samples were subjected to SDS/PAGE on a 4- 
20% gel. The protein bands were visualized by staining the gel with silver. 

25 Detailed Description of the Invention 

As used herein, the term "hK2 polypeptide" preferably 
encompasses the recombinant pre-pro, pro and mature hK2 polypeptides. As 
proposed herein, a mature hK2 polypeptide having the amino acid sequence 
shown in Fig. 4 (SEQ ID NO: 16), as well as "variant" polypeptides which 

30 share at least 90% homology with SEQ ID NO: 16 in the regions which are 
substantially homologous with hK3, i.e., which regions are not identified by 
bars as shown in Fig. 4. Such hK2 polypeptides also possess antigenic 
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function in common with the mature hK2 molecule of Fig. 4, in that said 
polypeptides are also definable by antibodies which bind specifically thereto, 
but which do not cross-react with hK3 (or hKl). Preferably, said antibodies 
react with antigenic sites or epitopes that are also present on the mature hK2 
5 molecule of Fig. 4. Antibodies useful to define common antigenic function 
are described in detail in Sen No. 08/096,946, i.e., polyclonal antisera 
prepared in vivo against hK2 submit 41-56. 

"Isolated hK2 nucleic acid" is RNA or DNA containing greater 
than 15, preferably 20 or more, sequential nucleotide bases that encode a 

10 biologically active hK2 polypeptide or a variant fragment thereof, that is 
complementary to the non-coding strand of the native hK2 polypeptide RNA 
or DNA, or hybridizes to said RNA or DNA and remains stably bound under 
stringent conditions. Thus, the RNA or DNA is isolated in that it is free from 
at least one contaminating nucleic acid with which it is normally associated in 

15 the natural source and is preferably substantially free of any other mammalian 
RNA or DNA. The phrase "free from at least one contaminating source 
nucleic acid with which it is normally associated" includes the case where the 
nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 

20 normally found in the source cell. An example of isolated hK2 nucleic acid is 
RNA or DNA that encodes a biologically active hK2 polypeptide sharing at 
least 90% sequence identity with the hK3-homologous regions of the hK2 
peptide of Fig. 4, as described above. The term "isolated, substantially 
homogenous" as used with respect to an hK2 polypeptide is defined in terms 

25 of the methodologies discussed herein below. 

As used herein, the term "recombinant nucleic acid," i.e., 
"recombinant DNA" refers to a nucleic acid, i.e., to DNA that has been 
derived or isolated from any appropriate tissue source, that may be 
subsequently chemically altered in vitro, an later introduced into target host 

30 cells, such as cells derived from animal, plant, insect, yeast, fungal or 

bacterial sources. An example of recombinant DNA "derived" from a source, 
would be a DNA sequence that is identified as a useful fragment encoding 
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hK2, or a fragment or variant thereof, and which is then chemically 
synthesized in essentially pure form. An example of such DMA "isolated" 
from a source would be a useful DNA sequence that is excised or removed 
from said source by chemical means, e.g. by the use of restriction 
5 endonucleases, so that it can be further manipulate e.g., amplified, for use in 
the invention, by the methodology of genetic engineering. 

Therefore, "recombinant DNA" includes completely synthetic 
DNA sequences, semi-synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
10 sequences derived from introduced RNA, as well as mixtures thereof. 
Generally, the recombinant DNA sequence is not originally resident in the 
genome of the host target cell which is the recipient of the DNA, or it is 
resident in the genome but is not expressed. 

The recombinant DNA sequence, used for transformation 
15 herein, may be circular or linear, double-stranded or single-stranded. 
Generally, the DNA sequence is in the form of chimeric DNA, such as 
plasmid DNA, that can also contain coding regions flanked by control 
sequences which promote the expression of the recombinant DNA present in 
the resultant cell line. For example, the recombinant DNA may itself 
comprise a promoter that is active in mammalian cells, or may utilize a 
promoter already present in the genome that is the transformation target 
Such promoters include the CMV promoter, as well as the SV 40 late 
promoter and retroviral LTRs (long terminal repeat elements). Aside from 
recombinant DNA sequences that serve as transcription units for hK2 or 
25 portions thereof, a portion of the recombinant DNA may be imtranscribed, 
serving a regulatory or a structural function. 

"Control sequences" is defined to mean DNA sequences 
necessary for the expression of an operably linked coding sequence in a 
particular host organism. The control sequences that are suitable for 
30 prokaryotic cells, for example, include a promoter, and optionally an operator 
sequence, and a ribosome binding site. Eukaryotic cells are known to utilize 
promoters, polyadenylation signals, and enhancers. 



20 
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"Operably linked 11 is defined to mean that the nucleic acids are 
placed in a functional relationship with another nucleic acid sequence. For 
example, DNA for a presequence or secretory leader is operably linked to 
DNA for a polypeptide if it is expressed as a preprotein that participates in 
5 the secretion of the polypeptide; a promoter or enhancer is operably linked to 
a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to 
facilitate translation. Generally, "operably linked" means that the DNA 
sequences being linked are contiguous and, in the case of a secretory leader, 
10 contiguous and in reading phase. However, enhancers do not have to be 
contiguous. Linking is accomplished by ligation at convenient restriction 
sites. If such sites do not exist, the synthetic oligonucleotide adaptors or 
linkers are used in accord with conventional practice. 

Aside from recombinant DNA sequences that serve as 
1 5 transcription units for hK2 or portions thereof, a portion of the recombinant 
DNA may be untranscribed, serving a regulatory or a structural function. 

The recombinant DNA to be introduced into the cells further 
will generally contain either a selectable marker gene or a reporter gene or 
both to facilitate identification and selection of transformed cells from the 
20 population of cells sougjit to be transformed. Alternatively, the selectable 
marker may be carried on a separate piece of DNA and used in a co- 
transformation procedure. Both selectable markers and reporter genes may be 
flanked with impropriate regulatory sequences to enable expression in the host 
cells. Useful selectable markers are well known in the art and include, for 
25 example, antibiotic and herbicide-resistance genes, such asnea hpt dhfr. bar. 
aroA. dapA and the like. 

Reporter genes are used for identifying potentially transformed 
cells and for evaluating the functionality of regulatory sequences. Reporter 
genes which encode for easily assayable proteins are well known in the art. 
30 In general, a reporter gene is a gene which is not present in or expressed by 
the recipient organism or tissue and which encodes a protein whose expression 
is manifested by some easily detectable property, e.g., enzymatic activity. 
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Preferred genes include the chloramphenicol acetyl transferase gene (cat) from 
Tn9 of E. colL the beta-glucuronidase gene (gus) of the uidA locus of E. coli . 
and the luciferase gene from firefly Photinus pyralis . Expression of the 
reporter gene is assayed at a suitable time after the DNA has been introduced 

5 into the recipient cells. 

Other elements functional in the host cells, such as introns, 
enhancers, polyadenylation sequences and the like, may also be a part of the 
recombinant DNA- Such elements may or may not be necessary for the 
function of the DNA, but may provide improved expression of the DNA by 

10 affecting transcription, stability of the mKNA, or the like. Such elements may 
be included in the DNA as desired to obtain the optimal performance of the 
transforming DNA in the cell. 

The general methods for constructing recombinant DNA which 
can transform target cells are well known to those skilled in the art, and the 

15 same compositions and methods of construction may be utilized to produce 
the DNA useful herein. For example, J. Sambrook et ah, Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory Press (2d ed., 1989), 
provides suitable methods of construction. 

The recombinant DNA can be readily introduced into the target 

20 cells by transfection with an expression vector comprising cDNA encoding 
hK2, for example, by the modified calcium phosphate precipitation procedure 
of C. Chen et al., M61. Cell. Biol., £ 2745 (1987). Transfection can also be 
accomplished by lipofectin, using commercially available kits, e.g., provided 
by BRL. 

25 Suitable host cells for the expression of hK2 polypeptide are 

derived from multicellular organisms. Such host cells are capable of complex 
processing and glycosylation activities. In principle, any higher eukaryotic 
cell culture is workable, whether from vertebrate or invertebrate culture. 
Examples of invertebrate cells include plant and insect cells. Numerous 

30 baculoviral strains and variants and corresponding permissive insect host cells 
from hosts such as Spodoptera frugipenta (caterpillar), Aedes aegypti 
(mosquito), Aedes albopictus (mosquito), DrvsopMa melanogaster (fruitfly), 
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and Bombyx mori have been identified See, e.g., Luckow et al., 
3io/Technplpgy, 6_; 47 (1988); Miller et al., in Genetic Engineering J. K. 
Setlow et al., eds., Vol. 8 (Plenum Publishing, 1986), pp. 277-279; and Maeda 
et al., Nature, 3J5j592 (1985). A variety of viral strains for transfection are 
5 publicly available, e.g., the L-l variant of Autogrupha cdifomica NPV and the 
Bm-5 strain of Bombyx mori NPV, and such viruses may be used, preferably 
for transfection of Spodoptera frugipenia cells. 

Recovery or isolation of a given fragment of DNA from a 
restriction digest can empoly separation of the digest on polyacrylamide or 

10 agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known 
molecular weight, removal of the gel section containing the desired fragment, 
and separation of the gel from DNA For example, see Lawn et al., Nucleic 
Acjds_Rgs J , % 6103-6114 (1981), and Goeddel et al.. Nucleic Acids Res.. 8, 

15 4057(1980). 

"Southern analysis" or "Southern blotting" is a method by 
which the presence of DNA sequences in a restriction endonuclease digest of 
DNA or DNA-containing composition is confirmed by hybridization to a 
known, labeled oligonucleotide or DNA fragment Southern analysis typically 

20 involves electrophoretic separation of DNA digests on agarose gels, 

denaturation of the DNA after electrophoretic separation, and transfer of the 
DNA to nitrocellulose, nylon, or another suitable membrane support for 
analysis with a radiolabeled, biotinylated, or enzyme-labeled probe as 
described in sections 9.37-9.52 of Sambrook et al., supra 

25 "Northern analysis" or "Northern blotting" is a method used to 

identify RNA sequences that hybridize to a known probe such as an 
oligonucleotide, DNA fragment, cDNA or fragment thereof, or RNA fragment 
The probe is labeled with a radioisotope such as 32-P, by biotinylation or with 
an enzyme. The RNA to be analyzed can be usually electrophoretically 

30 separated on an agarose or polyacrylamide gel, transferred to nitrocellulose, 
nylon, or other suitable membrane, and hybridized with the probe, using 
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standard techniques well known in the art such as those described in sections 
7.39-7.52 of Samhrook et al., supra 

"Polymerase chain reaction" or "PCR" refers to a procedure or 
technique in which amounts of a preselected piece of nucleic acid, RNA 

5 and/or DNA, are amplified as described in U.S. Patent No. 4,683,195. 
Generally, sequence information from the ends of the region of interest or 
beyond is employed to design oligonucleotide primers. These primers will be 
identical or similar in sequence to opposite strands of the template to be 
amplified. PCR can be used to amplify specific RNA sequences, specific 

10 DNA sequences from total genomic DNA, and cDNA transcribed from total 
cellular RNA, bacteriophage or plasmid sequences, and the like. See 
generally Mullis et al., Cold Spring Harbor Symp. Quant. Biol, 51, 263 
(1987); Erlich, ed., PCR Technology, (Stockton Press, NY, 1989). 

"Stringent conditions" are those that (1) employ low ionic 

15 strength and high temperature for washing, for example, 0.015 M NaCl/0.0015 
M sodium citrate (SSC); 0.1% sodium lauryl sulfite (SDS) at 50°C, or (2) 
employ during hybridization a denaturing agent such as formamide, for 
example, 50% (vol/vol) formamide with 0.1% bovine serum albumin/0.1% 
Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 

20 with 750 mM NaCl, 75 mM sodium citrate at 42°C. Another example is use 
of 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM 
sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's 
solution, sonicated salmon sperm DNA (50 jigfail), 0.1% SDS, and 10% 
dextran sulfate at 42°Q with washes at 42°C in 0.2 x SSC and 0.1% SDS. 

25 When hK2 polypeptide is expressed in a recombinant cell other 

than one of human origin, the hK2 polypeptide is completely free of proteins 
or polypeptides of human origin. However, it is necessary to purify hK2 
polypeptide from recombinant cell proteins or polypeptides to obtain 
preparations that are substantially homogeneous as to hK2 polypeptide. For 

30 example, the culture medium or lysate can be centrifuged to remove 

particulate cell debris. The membrane and soluble protein fractions are then 
separated. The hK2 polypeptide may then be purified from the soluble 
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protein fraction and, if necessary, from the membrane fraction of the culture 
lysate. HK2 polypeptide can then be purified from corrtarninant soluble 
proteins and polypeptides by fractionation on immunoafiHnity or ion-exchange 
columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 
5 or on a cation-exchange resin such as DEAE; chromatofocusing SDS-PAGE; 
ammonium sulfate precipitation; gel filtration using, for example, Sephadex 
G-75; or ligand affinity chromatography. 

Once isolated from the resulting transgenic host cells, 
derivatives and variants of the hK2 polypeptide can be readily prepared For 
10 example, amides of the hK2 polypeptides of the present invention may also be 
prepared by techniques well known in the art for converting a carboxylic acid 
group or precursor, to an amide. A preferred method for amide formation at 
the C-terminal carboxyl group is to cleave the polypeptide from a solid 
support with an appropriate amine, or to cleave in the presence of an alcohol, 
1 5 yielding an ester, followed by aminolysis with the desired amine. 

Salts of carboxyl groups of the hK2 polypeptide may be 
prepared in the usual manner by contacting the peptide with one or more 
equivalents of a desired base such as, for example, a metallic hydroxide base, 
e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, for 
20 example, sodium carbonate or sodium bicarbonate; or an amine base such as, 
for example, triethylamine, triethanolarnine, and the like. 

N-acyl derivatives of an amino group of the present 
polypeptides may be prepared by utilizing an N-acyl protected amino acid for 
the final condensation, or by acylating a protected or unprotected peptide. O- 
25 acyl derivatives may be prepared, for example, by acylation of a free hydroxy 
peptide or peptide resin. Either acylation may be carried out using standard 
acylating reagents such as acyl halides, anhydrides, acyl imidazoles, and the 
like. Both N- and O-acylation may be carried out together, if desired In 
addition, the internal hK2 amino acid sequence of Fig. 4 can be modified by 
30 substituting one or two conservative amino acid substitutions for the positions 
specified, including substitutions which utilize the D rather than L form. The 
invention is also directed to variant or modified forms of the hK2 polypeptide 
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of Fig. 4. One or more of the residues of this polypeptide can be altered, so 
long as antigenic function is retained Conservative amino acid substitutions 
are preferred-that is, for example, aspartic-glutamic as acidic amino acids; 
lysine/arginine/histidine as basic amino acids; leucine/isoleucine, 

5 methionine/valine as hydrophobic amino acids; 

serine/gjycine/alanine/threonine as hydrophilic amino acids. 

Acid addition salts of the polypeptides may be prepared by 
contacting the polypeptide with one or more equivalents of the desired 
inorganic or organic acid, such as, for example, hydrochloric acid Esters of 

10 carboxyl groups of the polypeptides may also be prepared by any of the usual 
methods known in the art. 

Once isolated, hK2 polypeptide and its antigenically active 
variants, derivatives and fragments thereof can be used in assays for hK2 in 
samples derived from biological materials suspected of containing hK2 or 

15 anti-hK2 antibodies, as disclosed in detail in Serial No. 08/096,946. For 
example, the hK2 polypeptide can be labelled with a detectable label, such as 
via one or more radiolabeled peptidyl residues, and can be used to compete 
with endogenous hK2 for binding to anti-hK2 antibodies, i.e., as a "capture 
antigen" to bind to anti-hK2 antibodies in a sample of a physiological fluid, 

20 via various competitive immunoassay format for hK2 which uses immobilized 
anti-hK2 antibodies is carried out by: 

(a) providing an amount of anti-hK2 antibodies attached to a solid 
surface; 

(b) mixing the sample of physiological fluid to be tested with a 
25 known amount of hK2 polypeptide which comprises a 

detectable label, to produce a mixed sample; 

(c) contacting said antibodies on said solid surface with said mixed 
sample for a sufficient time to allow immunological reactions 
to occur between said antibodies and said hK2, and between 

30 said antibodies and said labelled polypeptide; 

(d) separating the solid surface from the mixed sample; 
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(e) detecting or determining the presence or amount of labelled 
polypeptide either bound to the antibodies on the solid surface 
or remaining in the mixed sample; and 

(f) determining from the result in step (e) the presence or amount 
5 of said hK2 in said sample. 

In another format which can detect endogenous hK2 in a 
sample by a competitive inhibition immunoassay, a known amount of anti- 
hK2 antibody is added to a sample containing an unknown amount of 
endogenous hK2. The known amount is selected to be less than the amount 

10 required to complex all of the hK2 suspected to be present, e.g., that would be 
present in a sample of the same amount of physiological fluid obtained from a 
patient known to be prostate cancer. Next, a known amount of the hK2 
polypeptide of the invention or a subunit thereof, comprising a detectable 
label is added. If endogenous hK2 is present in the sample, fewer antibodies 

15 will be available to bind the labelled hK2 polypeptide, and it will remain free 
in solution. If no endogenous hK2 is present, the added labelled polypeptide 
will complex with the added anti-hK2 antibodies to form binary complexes. 
Next, the binary antibody-antigen complexes are precipitated by an anti- 
mammal IgG antibody (sheep, goat, mouse, etc.). The amount of radioactivity 

20 or other label in the precipitate (a ternary complex) is inversely proportional 
to the amount of endogenous hK2 that is present in the sample, e.g^ a pellet 
containing reduced amounts of radioactivity is indicative of the presence of 
endogenous hK2. 

Alternatively to the conventional techniques for preparing 

25 polyclonal antibodies or antisera in laboratory and farm animals, monoclonal 
antibodies against hK2 polypeptide can be prepared using known hybridoma 
cell culture techniques. In general, this method involves prepared an 
antibody-producing fused cell line, e.g., of primary spleen cells fused with a 
compatible continuous line of myeloma cells, and growing the fused cells 

30 either in mass culture or in an animal species from which the myeloma cell 
line used was derived or is compatible. Such antibodies offer many 
advantages in comparison to those produced by inoculation of animals, as they 
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are highly specific and sensitive and relatively "pure" immunochemically. 
Immunologically active fragments of the present antibodies are also within the 
scope of the present invention, e.g., the f(ab) fragment, as are partially 
humanized monoclonal antibodies. 
5 The invention will be further described by reference to the 

following detailed examples. 

Example \ f 

Consfauction of hK2 expression vectois 

10 (A) Generation of recombinant baculoviruses containing pphK2 and mhK2 
coding sequences 

A cDNA (approximately 820 bp long) encoding the entire 
prepro-hK2 (pphK2) (from nucleotide #40 to #858 relative to the start site of 
the pphK2 transcript), as shown in Fig. 5, was synthesized from RNA of 

15 human BPH tissue using reverse-transcription polymerase chain reaction (RT- 
PCR) technology with a pair of hK2 specific oligonucleotide primers 
(5ACGCGGATCCAGCATCTGGGACCTGGTTCTCI3 1 SEQ ID NO: 2 and 
5 ACAGCTGCAGTTTACTAGAGGTAGGGGTGGGAC 3* SEQ ID NO:3). 
This cDNA was generated such that 5' and 3' ends (with respect to pphK2 

20 sense sequence) were bracketed with BamHl and Pst 1 sequences 

respectively. The cDNA was then purified by agarose gel electrophoresis, and 
digested with BamHl and Pst 1 restriction enzymes. The restricted cDNA 
was ligated with the BamHl -Pst 1 digested pVL1393 plasmid vector and 
transformed into the E. coli HB101 strain. E. coli harboring pphK2 

25 cDNA/pVL1393 plasmid vector were selected and verified by restriction 

enzyme mapping and DNA sequencing. Plasmid pphK2 cDNA/pVL1393 was 
mass-produced in E. coli and purified by CsCl gradient ultra-centrifugation. 

cDNA encoding the mature hK2 was synthesized using PCR 
with the aforementioned pphK2 cDNA as the template plus a pair of hK2 

30 oligonucleotides (SACGCGGATCCAGCATGTGGGACCTGGTTCrcn' 
SEQ ID NO: 2 and 

5ACCGGAATTCATGATTGTGGGAGGCTGGGAGTGT3' SEQ ID NO: 4). 
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As noted, the 3' end oligonucleotide was the same 3' end oligonucleotide used 
for synthesizing the pphK2 cDNA. However, the 5' end oligonucleotide was 
different from the 5' oligonucleotide used for the pphK2 cDNA, and therefore 
generates a cDNA coding for the mature form of hK2 (mhK2), as shown in 
5 Fig. 6. The mhK2 cDNA was bracketed with EcoRl and Pstl sequences at 
the 5' and 3' ends respectively. The protein produced from the mhK2 cDNA 
will gain an exogenous methionine at its N-terminus. The mhK2/pVL1393 
vector was generated and purified as described for pphK2/pVL1393. The 
DNA sequence analysis for pphK2 and mhK2 in pVL1393 showed that one 

10 nucleotide (#805) has been altered (G to T) in a silent mutation. 

pphK2/pVL1393 or mhK2/pVL1393 DNA (2 fig) were 
cotransfected with a linearized Baculogold DNA (0.5 ug; Pharmingen, San 
Diego, CA) into S/9 insect cells according to Pharmingen instructions (S. 
Gruenwold et al., baculovirus expression vector system: Procedures and 

15 Methods Maniral, Pharmingen, San Diego, CA (1993)). Four to six days after 
the transfection, S/9 cell spent medium containing viral particles was harvested 
and used to infect fresh S[9 cells to amplify viral titers. Total RNA was 
isolated for Northern blot analysis of authentic pphK2 or mhK2 transcript 
using hK2 cDNA as a probe. Further proof of pphK2 or mhK2 transcript 

20 expressed in recombinant virus infected S/9 cells was obtained by RT-PCR 
and DNA sequencing. Pure recombinant baculovirus containing pphK2 or 
mhK2 were obtained by secondary or tertiary plaque purification protocol 
according to instructions from Pharmingen (S. Gruenwold et al., cited above). 

25 Example 2. 

Generation of prokaiyotic expression vector 

A 0.8 kb fragment representing the entire preprohK2 (pphK2) 
coding sequence was generated by polymerase chain reaction (PCR) using 
primers A (5TATACATATGTGGGACCTC^GT1UIU^CC3 , SEQ ID NO.: 11) 

30 and B (5 , ATATGGATCCIX^GGGGTTGGCTGCGATGGT3 , SEQ ID NO: 
12) and plasmid pVL1393 containing pphK2 as the template. The pphK2 
bacterial expression vector (pBPPHK2) was prepared by standard DNA 
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cloning technology, (Samhrook, cited above), to subclone this 0.8 kb fragment 
into the Ndel/BamHl site of the plasmid pPHS579 (a gift from Dr. H. 
Hsiung, Eli Lilly Co, Inc.) under the control of T7 promoter. The DNA of 
the entire insert plus the cloning sites was sequenced to confirm that no 
5 cloning artifacts had occurred and to ensure that no anomalies in the sequence 
had been generated by PCR. pBPPHK2 was transformed into E. coli BL21 
(DE3)Lys S (Novagen, Inc., Madison, WI). 

Example 3. 

10 Generation of a mammalian expression vector 

To express hK2 in mammalian cell lines, a 0.8 kb fragment 
representing the entire preprohk2 (pphK2) coding sequence was generated by 
PCR using primers 
A^ATATGGATCCATA^ 

15 (SEQ ID NO: 17) and 

BCS'ATATGGATCCTCAGGGGTTGGCTGCGATGGBO (SEQ ID NO: 12) 
and plasmid pVL1393 containing pphK2 as the template. The mammalian 
egression vector (pGlhK2) was prepared using standard DNA cloning 
technology (Sambrook, 1989), to clone this 0.8 kb fragment into the Bel 1 site 

20 of the plasmid pGT-d (a gift from Dr. Brian Grinnell, Eli Lilly, Inc.) under 
control of the GBMT promoter. The DNA of the entire insert plus the 
cloning sites was sequenced to confirm that no cloning artifacts had occurred 
and to ensure that no anomalies in the sequence had been generated by PCR. 
AV12-664 (ATCC CRL-9595), a cell line derived from a adenovirus-induced 

25 tumors in Syrian hamster, was grown in Dulbecco's modified Eagle's medium 
supplemented with 10% fetal bovine serum (D10F) and transfected with 
plasmid pGThK2 using the calcium phosphate method 

Example 4. 

30 Identification of recombinant pphK2 and mhK2 

A haculovirus - inse ct cell system 
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S/9 cells (7xlOVplate) were seeded onto 100 mm Coming 
plates with 10% fetal calf serum - Graces medium at room temperature for 1 
hr. After attachment on culture plates, cells were infected with wild type or 
recombinant baculovirus in serum free Excell-400 medium and incubated at 
5 27°C Control cells were grown in the absence of virus. At designated times 
(24-96 hr) cells were placed in fresh S/9-IIOO media deficient of either 
methionine or methionine and cysteine for 45-60 min at 27°C, then incubated 
with Promix (0.143 mCi/plate; a mixture of [ 35 S>methionine and [ 35 S]- 
cysteine; 1,000-1,400 Ci/mmol; Amersham) in serum free and 

10 methionine/cysteine deficient ^91100 medium (Biofluids) for 5-8 hr or 20 hr. 
After the end of each incubation time, cells and spent media were separated 
by centrifagation (1,000 rpm; Beckman J-6B; Beckman, Fullerton, CA). Cells 
were washed and centrifuged (13,000 rpm; Biofuge 13, Baxter) twice. The 
washed cells were lysed by freeze/thaw in a detergent buffer (10 mM Tris, pH 

15 7.5; 130 mM NaCl, 1% Triton X-100, 10 mM NaF; 10 mM NaPi, lOmM 
Nappi, pH7.5) or H 2 0 and centrifuged to obtain cytosol and insoluble cellular 
fractions. Protein contents of the above samples were determined by either 
the Bradford or Lowry method (BioRad, Inc., Melville, N.Y.). The above 
spent media, cystosol and insoluble cellular fraction were frozen and stored 

20 separately until used. A duplicate set of samples were prepared without 35 S- 
labeling. 

For SDS-polyaoylamide gel electrophoresis (PAGE) analysis of 
expression of hK2 protein in ^9 cells, samples were added to sample buffer 
(UK. Laemmli, Nature, 22Z, 680 (1970)), heated at 95°C for 5 minutes and 

25 subjected to SDS-PAGE under reducing conditions. 

Northern blot analysis was routinely used to screen and isolate 
clonal recombinant baculoviruses expressing pphK2 or mhK2 mRNA. A 
comparison of the corresponding lanes in both autoradiographs of the 
Northern blot and photographs of ethidium bromide staining of RNA shows 

30 that mRNA for pphK2 or mhK2 was present in recombinant virus infected S/9 
but not in wild type virus-infected cells. Moreover, each of the pphK2 or 
mhK2 mRNA positive lanes represents RNA isolated from S/9 cells infected 
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with recombinant viruses derived from a single viral plaque. Thus, the results 
suggest that high frequency (100%) of recombinant baculovirus containing 
either pphK2 or mhK2 was obtained from the above cotransfection. 
Furthermore, the sequences of pphK2 or mhK2 expressed in viral infected S/9 
5 cells were confirmed by a combination of RT-PCR, cloning and DNA 
sequencing. 

To determine Aether the pphK2 protein is expressed in the 
insect cell S/9, time course studies using 35 S-labeling of de novo synthesis of 
protein was performed and detected by SDS denaturing polyacrylamide gel 

10 electrophoresis (PAGE). As seen in the autoradiograph (Fig. 1), a unique 
protein (about 28 KDa) was found in pphK2-recombinant virus-infected S/9 
cells at 35-74 hour post-infection. This band was missing in uninfected cells 
or cells infected with wild type virus. The viral polyhedron protein (about 32 
KDa) was found (Fig. 1) as expected in S/9 cells infected with wild type 

15 virus, whereas it was not expressed by recombinant virus (Fig. 1). The 
protein was detected in cytosol when subcellular fractions (cytosol vs. 
insoluble fraction) was prepared by lysing cells with H 2 0 and freeze-thaw, 
whereas this 28 KD protein was detected in insoluble fraction when prepared 
by a detergent buffer and freeze-thaw (data not shown). 

20 The mhK2 protein was also expressed in the insect cell S^9, 35 S- 

labeling of de novo synthesized protein was performed. As seen in the 
autoradiograph (Fig. 2), a unique protein (about 28 KDa) was found in the 
insoluble fraction of mhK2-recombinant virus-infected S^9 cells at 48 hours 
post-infection. This band was missing in uninfected cells or cells infected 

25 with wild type virus. The viral polyhedron protein (about 32 KDa) was found 
in wild type virus-infected cells, whereas it was not expressed in cells infected 
with recombinant vims (Fig. 1). When the cytosol fraction was examined, no 
28 KDa band was observed. 



30 



B. E. Coli system 

Plasmid pBPPHK2 was transformed into E. coli BL21 (DE3) 
pLysS (Novagen, Inc., Madison, WI). This strain contains a chromosomal 
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copy of T7 RNA polymerase under the control of inducible LacUV5 
promoter. Upon addition of IPTG (isopropyl-P-D-thiogalactopyranoside) the 
expression of the T7 RNA polymerase is induced which in turn activates the 
T7 promoter resulting in overproduction of the gene product under control of 
5 this promoter. To determine whether the product of the ppHK2 gene would 
be expressed from pBPPHK2, single colonies of BL21 E. Coli transformed 
with pBPPHK2 were grown to O.D.^ = 0.2 in 10 ml LB media plus 
ampicillin (lOOug/ml) and induced with 0.4 mM IPTG (Sigma, Inc.). Cells 
were harvested 2 hours after induction by centrifugation and resuspended in 
10 1.5 ml SDS/PAGE sample buflfer (U.K. Laemmli, £to& 22Z, 680 1970) 
before SDS/PAGE analysis. The cell pellet from the IPTG-induced culture 
was resuspended in 0.05M Tris, pH 8.0 (at 9ml/gm cell pellet) and stirred at 
room temperature (25°C, r.t.) for 1 hour. Lysozyme (4 mg/ml) was added to 
this suspension (at 1 ml/gm cell pellet) and the suspension was stirred at r.t. 
15 for 30 min followed by incubation on ice for 30 min. The suspension was 
sonicated for 2 min at 150 watts and centrifuged at 3000xg to isolate the 
inclusion bodies. Inclusion bodies were resuspended in running buffer (25 
mM Tris, 192 mM glycine, 0.1% SDS) and after centrifugation both the pellet 
and the supernatant were analyzed by SDS/PAGE 
20 About 90% of the pphK2 was found to be in the supernatant 

fraction which indicated mat pphK2 is soluble in 0.1% SDS. To prepare 
samples for amino acid sequence analysis, 20ul of inclusion body lysate was 
subjected to SDS/PAGE on a 4-20% gradient gel (BIO-RAD, Inc., Melville, 
N.Y.). The protein was blotted from the gel onto 0.2u PVDF paper (BIO 
25 RAD) and stained with Coomassie blue. The protein band of interest was cut 
out from the blot and subjected to amino acid sequencing using a protein 
sequencer model 477A (Applied Biosystem, Inc., Foster City, CA). 

The induced cells overproduced large amounts of a polypeptide 
with apparent molecular mass of about 28kd (Figure 3). Densitometric 
30 analysis indicated that this protein comprised approximately 40% of total 
cellular protein. The size of this protein as determined by an SDS-PAGE gel 
was comparable to that predicted from coding sequence for pphK2. To 
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confirm that this protein is pphK2, the sequence of the first 10 amino acids 
(MWDLVLSIAL) (SEQ ID NO: 13) from the N-terminus was determined. 
This sequence agrees perfectly with that deduced from the DNA sequence of 
pphK2 cDNA As noted, it has different identity from the first 10 amino 
5 acids of both pphKl (MWFLVLCLAL) (SEQ ID NO: 14) and pphK3 
(MWVPVVFLTL) (SEQ ID NO: 15). It also shows that this protein is not 
modified or processed at the N-terminus either during or after expression in E. 
coli. These results demonstrate that we were able to accurately express 
pphK2 in E. coli from pBPPHK2. 

10 

C Mammalian System 

1* Isolation and Purification of protein 

Plasmid pGThK2 was transformed into hamster cell line AVI 2- 
664 (ATCC-CRL-9595). To determine whether the product of the ppHK2 

15 gene would be expressed from pGThK2, AV12-pGThK2 #2 was grown in 
D10F + 200nM MIX At about 60% confluency the cells were washed with 
Hank's balanced salt solution and resuspended in serum-free HH4 medium. 
The spent medium was collected after 7 days (serum-free spent medium) and 
stored at -20°C. Figure 8 depicts a SDS-PAGE confirming expression of 

20 recombinant pphK2 in a mammalian cell line. AV12-pGIhK2 (Lane 4-6) and 
AV12-pGT-d (Lane 3) clonal cell lines were grown in D10F media. About 
300(il of spent medium from the above clones were concentrated and 
subjected to SDS/PAGE along with See Blue MW marker (lane 1) and pphK2 
lysate from E. coli cells (lane 2). The gel was blotted onto nitrocellulose 

25 paper and immunoblotted using a 1/1000 dilution of anti-pphK2 rabbit 

antiserum. HRP-goat anti-rabbit was used as the secondary probe and the blot 
was developed by DAB plus H^. Lane 3 (AV12-pGT-d) is AV12 
transfected with vector without insert. 

To purify the protein, the serum-free spent medium was 

30 concentrated from 5-10 fold by ultrafiltration with a 10 kDa molecular weight 
cutoff membrane then dialyzed overnight at 4°C versus 50 mM sodium 
bicarbonate, pH 8. Samples were filtered with 0.2 \i filters and then pumped 
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directly onto a TSK DEAE-5PW HPLC column (21 mm X 150 mm) at a flow 
rate of 5 mUmin. Buffo- A contained 50 mM sodium bicarbonate, pH 7.9 
Buffer B contained 50 mM sodium bicarbonate plus 0.5 M sodium chloride, 
pH 7.6. The elution profile shown in Figure 9 was developed with a gradient 
5 from 0-50% Buffer B over 35 min; 50-100% B from 35-40 min and isocratic 
elution at 100% B for 5 min before re-equilibration in Buffer A. The flow 
rate was 5mL/min throughout 

DEAE fractions were assayed for the presence of hK2 by 
EUSA using rabbit anit-pphK2 as primary antibodies. The EUSA assayed 

10 showed a peak of hK2 activity which eluted at approximately 0-2M NaCl 
(shown as the triangle line in Figure 9), which correlated well with the 
appearance of a 34 kDa band of protein seen by SDS-PAGE in the same 
fractions (data not shown). 

Fractions with hK2 activity were pooled and concentrated by 

15 ultrafiltration with 10 kDa membranes to approximately 5-8 mL where upon 
solid ammonium sulfate was added to make a final concentration of 1 M. 
This sample was then injected onto a PolyLC. polypropyl aspartamide column, 
1000 A pore size, 4.6 mm X 200 mm, to resolve protein by hydrophobic 
interaction chromatography (HIC, see Figure 10). Buffer A was 20 mM Na 

20 phosphate, 1 .2 M Na sulfate pH 6.3 and Buffer B was 50 mM Na phosphate, 
5% 2-propanol, pH 7.4. The elution gradient was 0-20% B over 5 min; 20- 
55% B from 5-20 min, isocratic at 55% B from 20-23 min, 55-100% B from 
23-25 min; isocratic at 100% B for 2 min before re-equilibration Buffer A 
The flow rate was 0.7 mL/min. Greater than 90% of the A^ was not 

25 retained on HIC column. The main peak retained on HIC, which eluted at 22 
min, also showed the highest peak of activity by EUSA assay (triangle line, 
Figure 10). 

HIC fractions which tested positive for hK2 on EUSA were 
pooled, ultrafilter concentrated as above to a volume less than 1 mL then 
30 injected cm a 10/30 Pharmacia S12 size exclusion column equilibrated in 100 
mM ammonium acetate. The flow rate was 0.7 mL/min. When the 22 min 
peak from HIC was resolved by size exclusion chromatography, typically 
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about 80-90% of the protein A 280 eluted at 19.4 min, a retention time 
consistent with a protein of approximately 34 kDa (Figure 1 1). The only 
other protein peak on SEC, eluting at 16.7 min, corresponded to an about 
70KDa protein seen also in previous purification steps. 

5 To examine the efficiency of our purification scheme, 1.5 \ig of 

purified phK2 was subjected to SDS/PAGE in the presence or absence of [3- 
mercaptoethanol (BME), and the gel was stained with silver. Results showed 
that the phK2 in our sample was about 95% pure (Fig. 12). It also showed 
that pro-hK2 migrated at about 30 KD in the absence of BME, and it 

10 migrated at about 34 kDa in the presence of BME. This pattern is similar to 
that observed for the PSA purified from seminal fluid (Fig. 12). 

Recombinant phK2 is recognized by rabbit anti-pphK2, rabbit 
anti-PSA and a murine monoclonal antibody directed against a polypeptide 
covering amino acids 41-56 of hK2, when analyzed on WESTERN blots or 

15 when dried down on microtiter plates. However, phK2 was not detectable by 
these antibodies in sandwich assays. These results further demonstrate that 
the phK2 and PSA are conformationally different and the antibodies currently 
available to PSA or hK2 can not detect phK2 in its native form. Furthermore, 
phK2 was not detectable by the Tandem R or free-PSA assays (immunological 

20 assays for detection of PSA in serum). 

A sample of the hybridoma (HK1A 523.5) secreting the murine 
monoclonal antibody has been deposited in the American Type Culture 
Collection, Rockville, MD, and assigned ATCC HB-11876. 

25 2. Amino Acid Analysis and Protein Sequencing of phK2 

The peak collected off size exclusion chromatography (SEC) in 
ammonium acetate was lyophilized to remove the buffer then reconstituted in 
water. An aliquot (2.5jig)of this sample was loaded on a Porton membrane 
(Beckman instruments) and subjected to automated N-terminal sequence 
30 analysis on an Applied Biosystems model 477A protein sequencer which 
yielded the following sequence: 
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Val-Pn>I^-Ile^^ 18). An 

aliquot of the same sample in water was also hydrolyzed in gaseous 6 N HCI 
under vacuum for 20 h at 112 °C then reconstituted in 0.1N HQ and analyzed 
on an Hewlett Packard Aminoquant amino acid analyzer utilizing pre-column 
5 derealization of amino acids with OPA for primary and FMOC for secondary 
amines. 

No competing sequence was evident from the profile of amino 
acids released sequentially by the Edman degradation procedure. By analogy 
to PSA this protein is pro hK2, since the known sequence of mature PSA has 
10 been shown to begin with Ileu-Val-Gly-etc and pro PSA has been postulated 
to have an extra 7 amino acids at the N-terminus. Amino acid analysis of this 
protein yielded an amino acid composition consistent with the recombinant 
sequence of prohK2. These results demonstrate that pphK2 was accurately 
expressed in the mammalian cell line AV12-664 from pGThK2. 

15 

Example 5. 

Production of antibodies to recombinant pphK2 

A. E. Coli System 

To prepare pphK2 for rabbit immunization, the inclusion bodies 

20 obtained from bacterial cultures of BL21 (pBpphK2) after IPTG induction as 
in Example 4B were resuspended in 100^1 SDS/PAGE sanple buffer/ml 
bacterial culture and electrophoresed on preparative SDS/PAGE The pphK2 
band was excised and electroeluted from the gel into running buffer (25mM 
Tris, 192 mM glycine, 0.1% SDS) and used as the immunogen. Two rabbits 

25 were each immunized with lOOjig of the immunogen in complete Freunds 
adjuvant and were boosted twice in three week intervals with 100jng of the 
immunogen in incomplete Freund's adjuvant and PBS, respectively. Rabbit 
anti-pphK2 sera was obtained one week following the second boost. The 
presence of anti-pphK2 in the rabbit antiserum was shown by ELISA (data not 

30 shown). Once confirmed by this method, the highest titer antiserum was 
tested on Western blots using lysates from IPTG induced or non-induced 
cultures of BI.21 (pBpphK2). It was evident that the antiserum contained 
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antibodies highly specific for the pphK2 protein since a protein band at about 
28kd corresponding to pphK2 was recognized only in the induced lysate. The 
antiserum also recognized the purified pphK2 further showing the specificity 
of the antibodies to pphK2. Hie above data demonstrate that the antibodies 
5 recognize the prepro-form of hK2. 

To delineate if the antiserum recognizes the mature form of 
hK2 (mhK2), mhK2 was expressed in E. coli as a glutathione S-transferase 
fusion protein (GST-mhK2, 58kd), and the cell lysate was immunoblotted 
using anit-pphK2 rabbit antiserum. It was evident that anti-pphK2 antiserum 

10 recognized the GST-mkK2, demonstrating that antibodies were at least in part 
against the mature region of pphK2. To examine the pattern recognized in 
seminal fluid by anti-pphK2 antibodies, seminal fluid was prepared from 
pooled semen as described by Sensabaugh and Blake, J. Urology. 149. 1523 
(1990), and immunoblotted with anti-pphK2 rabbit antiserum. The antiserum 

15 recognized a major band at about 34kd plus several minor bands at lower 
MW. The pre-immune serum did not recognize any bands in any of the 
above experiments, showing that the antibodies were generated by 
immunization. 

To determine whether there are any pphK2-specific antibodies 
20 in rabbit anit-pphK2 antiserum, the antibodies cross-reacting to PSA were 
absorbed out of the antiserum by a PSA affinity resin. Specifically, 1ml of 
the sera was diluted with lmL 100 mM HEPES, pH. 7.5 and incubated with 
native PSA-bound AfBgel-10 for 3.5 hours at 4°C. The mixture was used 
poured into a column, the flow-through was collected and the column was 
25 washed with 30 ml HEPES buffer. Antibodies bound to the column (eluate) 
were eluted by acetic acid (IN, pH 4.0) and neutralized to pH. 6.6 with 
NH4OH. Native PSA was isolated from seminal fluid as described by 
Sensabaugh and Blake, cited above. ppPSA was purified from E. coli 
transformed with plasmid pPHS579 (containing ppPSA under control of T7 
30 promoter) using a procedure analogous to pphK2 purification. 

The flow-through and the column eluate were tested for Abs 
recognizing pphK2, ppPSA and native PSA (PSA isolated from seminal fluid) 
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using Western blot analysis. It was evident that antibodies in the untreated 
rabbit anti-pphK2 antiserum recognized all three proteins indicating that 
PphK2, ppPSA and seminal fluid-PSA share some similar epitopes. However, 
while the column eluate contained antibodies that recognized all three protein, 
5 the flow-through contained antibodies that recognized only pphK2. This 
indicates that anti-pphK2 antiserum contains pphK2-specific antibodies and 
these antibodies can be isolated by PSA affinity absorption. This system 
enabled us to generate anti-pphK2 antibodies which recognize both pphK2 and 
mhK2. Thus, utilizing immunogenic and pure recombinant hK2 protein, 

10 generate rabbit antiserum was generated which contains pphK2-specific 

antibodies, providing a valuable source for generating and screening for hK2- 
specific monoclonal antibodies. 

These examples describe the use of three heterologous 
expression systems (i.e. both prokaryotic and eukaiyotic) for the successful 

15 expression of the hK2 polypeptide. Thus, the method of the invention enables 
production of large quantities of substantially pure hK2 polypeptide. The 
polypeptide can be used both to study its biological functions and to produce 
immunodetection reagents such as labelled hK2 polypeptide, labelled 
fragments thereof and antibodies thereto. The immunoreagents can provide a 

20 method to purify native hK2 and to study the properties of the purified native 
hK2 polypeptide. 

The pphK2 overproduced in E. coli can be readily solubilized 
in0.1%SDS, thus solubility is not a problem. This is in contrast to the 
expression of human salivary kallifcrein protein, hKl, in E. coli, which was 

25 found in insoluble inclusion bodies (J. Wang, et al Biochem. 1. 276, 63 
(1991)). In contrast, the present invention yields almost pure protein which 
can be purified to homogeneity by preparative SDS-PAGE. This purified 
recombinant pphK2 can be used for the generation of monoclonal and 
polyclonal antibodies. 

30 As shown above, Baculogold viral DNA can be used to 

generate a recombinant baculovirus containing pphK2 or mhK2. Use of 
Baculogpld viral DNA provides high selection of positive recombinant 
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baculoviruses. Indeed, Northern blot analysis showed a high frequency of 
recombinant vims expressing pphK2 or mhK2 mRNA Moreover, SDS-PAGE 
analysis showed that both pphK2 and mhK2 recombinant viruses produced 
unique proteins with sizes similar to the calculated molecular weights for 

5 pphK2 or mhK2. Although the levels of the recombinant hK2 expressed in 
insect system may not be as high as in E. coli, the hK2 protein produced in 
baculovirus- insect system may contain the secreted form which would be 
more like the natural form of the protein. 

Plasmids pphK2/pVL1393 in E. col. H13101 has been 

10 deposited in the American Type Culture Collection, Rockville, MD, USA on 
May 2, 1994 under the provisions of the Budapest Treaty and have been 
assigned accession number ATCC 69614. 

The invention has been described with reference to various 
specific and preferred embodiments and techniques. However, it should be 

15 understood that many variations and modifications may be made while 
remaining within the spirit and scope of the inventioa 



WO 95/30758 



PCT/US95/06157 



30 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Mayo Foundation for Medical Education 
and Research 
Hybritech Incorporated 
Tindall, Donald J. 
Young, Charles Y.F. 
Saedi, Mohammed S. 

(ii) TITLE OF INVENTION: Recombinant HK2 Polypeptide 
(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schwegman, Lundberg & Wbessner, P. A. 

(B) STREET: 3500 IDS Center 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: USA 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDI UM T YPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Raasch, Kevin W. 

(B) REGISTRATION NUMBER: 35,561 

(C) REFERENCE/DOCKET NUMBER: 150.148WO1 

(ix) TELEO0MMUNICATICN INFORMATICN: 

(A) TELEPHONE: 612-339-0331 

(B) TELEFAX: 612-339-3061 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

He Val. Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
15 10 15 

Leu Val Ala Ser Arg Gly Arg Ala Val Cys Gly Gly Val Leu Val His 
20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys He Arg Asn Lys Ser Val 
35 40 45 

He Leu Leu Gly Arg His Ser Leu Phe His Pro Glu Asp Thr Gly Gin 
50 55 60 

Val Phe Gin Val Ser Thr Ser Phe Pro His Pro Leu Tyr Asp Met Ser 
65 70 75 80 

Leu Leu Lys Asn Arg Phe Leu Arg Pro Gly Asp Asp Ser Ser His Asp 
85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Glu Leu Thr Asp Ala Val 
100 105 110 

Lys Val Met Asp Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe Leu Thr Pro 
130 135 140 

Lys Lys. Leu Gin Cys Val Gin Leu His Val He Ser Asn Asp Val Cys 
145 150 155 160 

Ala Gin Val His Pro Gin Lys Val Thr Lys Phe Met Leu Cys Ala Gly 
165 170 175 

Arg Trp Thr Gly Gly Lys Ser Thr Cys Ser Gly Asp Ser Gly Gly Pro 
180 185 190 

Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Ser Glu 
195 200 205 

Pro Cys Ala Leu Pro Glu Arg Pro Ser Leu Tyr Thr Lys Val Val His 
210 215 220 

Tyr Arg Lys Trp He Lys Asp Thr He Val Ala Asn Pro 
225 230 235 



WO 95/30758 



32 

(2) INFORMATION FDR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
AOGOGGATCC AGCAICTGQG AOCTGGTTCT CT 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE OlARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
ACAGCTGCAG TTTACTAGAG GTAGGGGTQG GAC 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
ACOGGAAITC ATCA3TGTGG GAGGCTQGGA GICT 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 10. .792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGATCCAGC ATG TGG GAC CIG GTT CTC TCC ATC GCC TTG TCT GIG GGG 48 
Met Trp Asp Leu Val Leu Ser lie Ala Leu Ser Val Gly 
1 5 10 

TGC ACT GGT GCC GIG CCC CTC ATC CAG TCT CGG ATT GIG GGA GGC TGG 96 
Cys Thr Gly Ala Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp 
15 20 25 

GAG TGT GAG AAG CAT TCC CAA CCC TGG CAG GIG GCT GIG TAC ACT CAT 144 
Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His 
30 35 40 45 

GGA TGG GCA CAC TGT GGG GGT GTC CIG GIG CAC CCC CAG TGG GIG CTC 192 
Gly Trp Ala His Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu 
50 55 60 

ACA GCT GCC CAT TGC CTA AAG AAG AAT AGC CAG GTC TGG CIG GGT CGG 240 
Thr Ala Ma His Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg 
65 70 75 

CAC AAC CIG TIT GAG OCT GAA GAC ACA GGC CAG AGG GTC CCT GTC AGC 288 
His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser 
80 85 90 

CAC AGC TIC CCA CAC COG CTC TAC AAT ATG AGC CTT CIG AAG CAT CAA 336 
His Ser Phe Pro His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin 
95 100 105 

AGC CTT AGA CCA GAT GAA GAC TCC AGC CAT GAC CTC ATG CIG CTC OGC 384 
Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg 
110 115 120 125 

CIG TCA GAG OCT GCC AAG ATC ACA GAT GIT GIG AAG GTC CIG GGC CIG 432 
Leu Ser Glu Pro Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu 
130 135 140 

CCC ACC CAG GAG CCA GCA CIG GGG AOC ACC TGC TAC GCC TCA GGC TGG 480 
Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp 
145 150 155 

GGC AGC ATC GAA CCA GAG GAG TIC TIG OGC CCC AGG ACT CTT CAG TGT 528 
Gly Ser He Glu Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys 
160 165 170 
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CTG AGC CTC CAT CTC CTG TCC AAT GAC ATG TCT GCT AGA GCT TAG TCT 576 
Val Ser Leu His Leu Leu Ser Asn Asp Met Cys Ala Ara Ala Tyr Ser 
175 180 185 

GAG AAG GIG ACA GAG TIC AIG TIG TCT GCT GGG CTC IGG ACA GGT GCT 624 
Glu Lys Val Thr Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Glv Glv 
190 195 200 * 205 

AAA GAC ACT TCT GGG GGT GAT TCT GGG GGT CCA CTT GTC TCT AAT GCT 672 
Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly 
210 215 220 

GIG CIT CAA GCT ATC ACA TCA TOG GGC OCT GAG CCA TCT GCC CIG CCT 720 
Val Leu Gin Gly He Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro 
225 230 235 

GAA AAG OCT GCT GIG TAC ACC AAG GIG GIG CAT TAG OGG AAG TOG ATC 768 
Glu Lys Pro Ala Val Tyr Thr Lys Val Val His Tyr Ara Lys Trp He 
240 245 250 

AAG TAC ACC ATC GCA GCC AAC COC TCAGIGCCCC TGTCCCACCC CEAOCTCTAG 822 
Lys Tyr Thr He Ala Ala Asn Pro 
255 260 



TAAACTGCAG 



832 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Trp Asp Leu Val Leu Ser He Ala Leu Ser Val Gly Cys Thr Gly 
1 5 10 15 

Ala Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu Cys Glu 
20 25 30 

Lys His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala 
35 40 45 

His Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala 
50 55 60 

His Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu 
65 70 75 80 

Phe Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe 
85 90 95 

Pro His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Ara 
100 105 no 
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Pro Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu 
115 120 125 

Pro Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin 
130 135 140 

Glu Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He 
145 150 155 160 

Glu Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu 
165 170 175 

His Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val 
180 185 190 

Thr Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr 
195 200 205 

Cys Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin 
210 215 220 

Gly He Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro 
225 230 235 240 

Ala Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp He Lys Tyr Thr 
245 250 255 

He Ala Ala Asn Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 720 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAATTC ATG ATT GIG GGA GGC TGG GAG TCT GAG AAG CAT TCC CAA CCC 48 
Met He Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro 
15 10 

TGG CAG GIG GCT GIG TAG AGT CAT GGA TOG GCA CAC TGT GGG GGT GTC 96 
Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His Cys Gly Gly Val 
15 20 25 30 

CTG GIG CAC CCC CAG TGG GIG CTC ACA GCT GCC CAT TGC CTA AAG AAG 144 
Leu Val His Pro Gin Trp Val Leu Thr Ma Ala His Cys Leu Lys Lys 
35 40 45 
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AAT AGC CAG CTC TGG CTG GGT 03G CAC AAC CTG TTT GAG CCT GAA GAC 192 
Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp 
50 55 60 

^^^^^ OT ^^^ c AGCTTCOC^CACCCGCTCTAC 240 
Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr 
65 70 75 

AAT ATG AGC CTT CTG AAG CAT CAA AGC CTT AGA CCA GAT GAA GAC TCC 288 
Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser 
80 85 go 

AGC CAT GAC CTC ATG CTG CTC CGC CTG TCA GAG CCT GCC AAG ATC ACA 336 
Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys He Thr 
95 100 105 no 

®^ ^p!f GTC CTG GGC CTG CCC ACC CAG GAG CCA GCA CTG GGG 384 
Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Glv 
115 120 125 

^^^ mC ^^^^^^ATCGAACCAGAGGAGTTC 432 
Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe 
130 135 140 

TTG CGC CCC AGG AGT CTT CAG TGT GTG AGC CTC CAT CTC CTG TCC AAT 480 
Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn 
145 150 155 

GAC ATG TGT GCT AGA GCT TAC TCT GAG AAG GTG ACA GAG TTC ATG TTG 528 
Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu 
160 165 170 

TGT GCT GGG CTC TGG ACA GGT GGT AAA GAC ACT TCT GGG GGT GAT TCT 576 
Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser 
175 180 185 190 

GGG GGT CCA CTT GTC TGT AAT GGT GTG CTT CAA GGT ATC ACA TCA TGG 624 
Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Ttd 
195 200 205 

^^^^^^^ OT ^^^GCTGTGTACACCAAG 672 
Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lvs 
210 215 220 

PF? £p? TGG ATC AAG TAC ACC ATC GCA GCC AAC CCC 720 

Val Val His Tyr Arg Lys Trp He Lys Tyr Thr He Ala Ala Asn Pro 
225 230 235 

TGAGTGCCCC TGTCCCACCC CTACCTCTAG TAAACTGCAG 760 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met He Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin 
15 10 15 

Val Ala Val Tyr Ser His Gly Trp Ala His Cys Gly Gly Val Leu Val 
20 25 30 

His Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser 
35 40 45 

Gin Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly 
50 55 60 

Gin Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met 
65 70 75 80 

Ser Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His 
85 90 95 

Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys He Thr Asp Val 
100 105 110 

Val Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr 
115 120 125 

Cys Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe Leu Arg 
130 135 140 

Pro Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn Asp Met 
145 150 155 160 

Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu Cys Ala 
165 170 175 

Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly 
180 185 190 

Pro Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Pro 
195 200 205 

Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys Val Val 
210 215 220 

His Tyr Arg Lys Trp He Lys Tyr Thr He Ala Ala Asn Pro 
225 230 235 
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(2) INFORMATION FOR SBQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .732 



(xi) SECXJENCE DESCRIPTION: SEQ ID NO: 9: 



GIG CCC CTC ATC CAG TCT OGG ATT GIG GGA GGC TOG GAG TCT GAG AAG 48 
Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu Cys Glu Lys 
15 10 15 

CAT TCC CAA CCC TCG CAG GIG GCT GIG TAC ACT CAT GGA TGG GCA CAC 96 
His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His 
20 25 J 30 

JP 1 * 2?* 2?^ ^9 TOG GIG CTC ACA GCT GCC CAT 144 

Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His 
35 40 45 

TCC CIA AAG AAG AAT AGC CAG CTC TGG CTG GGT OGG CAC AAC CTG ITT 192 
Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Hie 
50 55 60 

GAG OCT GAA GAC ACA GGC CAG AQG GTC OCT CTC AGC CAC AGC TTC CCA 240 
Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro 
65 70 75 80 

CAC COG CTC TAC AAT ATG AGC CTT CIG AAG CAT CAA AGC CTT AGA CCA 288 
His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Ara Pro 
85 90 95 

GAT GAA GAC TCC AGC CAT GAC CTC ATC CIG CTC OGC CIG TCA GAG CCT 336 
Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro 
100 105 no 

GCC AAG ATC ACA GAT GTT GIG AAG CTC CIG GGC CIG CCC AOC CAG GAG 384 
Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu 
115 120 125 

^^CTGG^^^CTCCTACGCCTCAGGCTCGGGCAGCATCG^ 432 
Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu 
130 135 140 

CCA GAG GAG TTC TIG OGC CCC AGG ACT CTT CAG TCT GTG AGC CTC CAT 480 
Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His 
145 150 155 160 

CTC CIG TCC AAT GAC ATC TCT GCT AGA GCT TAC TCT GAG AAG GIG ACA 528 
Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr 
165 170 175 
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GAG TTC ATC TIG TGT GCT GGG CTC TGG ACA GOT GOT AAA GAC ACT TGT 576 
Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys 
180 185 190 

GGG GGT GAT TCT GGG GCT CCA CIT GTC TGT AAT GGT GIG CTT CAA GGT 624 
Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly 
195 200 205 

ATC ACA TCA TGG GGC CCT GAG CCA TCT GCC CIG CCT GAA AAG CCT GCT 672 
lie Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala 
210 215 220 

CTG TAC ACC AAG GIG GIG CAT TAC COG AAG TGG ATC AAG TAC ACC ATC 720 
Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp lie Lys Tyr Thr lie 
225 230 235 240 

GCA GCC AAC CCC TGACTGCCCC TCTCCCACCC CTACCTCIAG TAAA 766 
Ala Ala Asn Pro 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Pro Leu lie Gin Ser Arg lie Val Gly Gly Trp Glu Cys Glu Lys 
15 10 15 

His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His 
20 25 30 

Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His 
35 40 45 

Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Phe 
50 55 60 

Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro 
65 70 75 80 

His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro 
85 90 95 

Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro 
100 105 110 

Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu 
115 120 125 

Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu 
130 135 140 

Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His 
145 150 155 160 
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Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr 
165 170 175 

Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys 
180 185 190 

Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly 
195 200 205 

He Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala 
210 215 220 

Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp He Lys Tyr Thr He 
225 230 235 240 

Ala Ala Asn Pro 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SECUENCE DESCRIPTION: SEQ ID NO: 11: 
TATACATATG TGGGACCP3G TTCTCTCC 28 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATATCGATCC TCAGQQGITG GCTG03ATGG T 
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(2) INFORMATION FOR SEQ ID NO; 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Trp Asp Leu Val Leu Ser lie Ala Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Met Trp Phe Leu Val Leu Cys Leu Ala Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

Met Trp Val Pro Val Val Phe Leu Thr Leu 
15 10 



WO 95/30758 



PCT/US95/06157 



42 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
1 5 10 15 

Ala Val Trp Ser His Gly Trp Ala His Cys Gly Gly Val Leu Val His 
20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin 
35 40 45 

Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin 
50 55 60 

Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met Ser 
65 70 75 80 

Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp 
85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys lie Thr Asp Val Val 
100 105 no 

Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe Leu Ara Pro 
130 135 140 * 

Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn Asp Met Cys 
145 150 155 * 160 

Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu Cys Ala Glv 
165 170 175 

Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser Gly Glv Pro 
180 185 190 

Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Pro Glu 
195 200 205 

Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys Val Val His 
210 215 220 

Tyr Arg Lys Trp He Lys Asp Thr lie Ala Ala Asn Pro 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATATGGATCC ATATGTCAGC ATGTGGGACC TGGITCTCTC CA 42 



(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) . SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu 
15 10 
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WHAT IS CLAIMED IS: 

L Isolated, substantially homogeneous pre-pro hK2 polypeptide. 
5 2. Isolated, substantially homogenous pro hK2 polypeptide. 

3. Isolated, substantially homogenous mature hK2 polypeptide. 

4. An antibody that is capable of specifically binding the hK2 polypeptide 
10 of claims 1, 2 or 3 and which does not bind to hK3. 

5. The antibody of claim 4 which is a monoclonal antibody. 

6. A hybridoma cell line producing the antibody of claim 5. 

15 

7. An isolated nucleic acid molecule encoding the polypeptides of claim 
1,2 or 3. 

8. An isolated nucleic acid molecule selected from the group consisting 
20 of 

(a) cDNA comprising the nucleotide sequence of the coding region 
of the hK2 gene; 

(b) DNA capable of hybridizing under stringent conditions to a 
nucleotide sequence complementary to the nucleotide sequence 

25 of (a); and 

(c) a genetic variant of any of the DNA of (a) and (b) which 
encodes a polypeptide possessing an antigenic function of 
naturally occurring hK2 polypeptide. 

30 9. The nucleic acid molecule of claim 8 further comprising a promoter 
operably linked to the nucleic acid molecule. 
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10. A chimeric expression vector comprising the nucleic acid molecule of 
claim 7 operably linked to control sequences recognized by a host cell 
transformed with the vector. 

5 11. The vector of claim 10 wherein the host cell is E. coli . 

12. The vector of claim 10 wherein the host cell is a mammalian cell. 

13. A host cell transformed with the vector of claim 10. 

10 

14. The host cell of claim 13 which is E. coli . 

15. The host cell of claim 13 which is mammalian. 

15 16. A method of using a nucleic acid molecule encoding a hK2 

polypeptide comprising expressing the nucleic acid molecule of claim 
7 in a cultured host cell stably transformed with a chimeric vector 
comprising said nucleic acid molecule operably linked to control 
sequences recognized by the host cell transformed with the vector, and 

20 recovering hK2 polypeptide from the host cell. 

17. The method of claim 16 wherein the host cell is E. coli . 

18. The method of claim 16 wherein the host cell is mammalian. 

25 

19. The method of claim 16 wherein the nucleic acid molecule is DNA 

20. The method of claim 16 wherein the hK2 polypeptide is recovered 
from the host cell culture medium. 
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CCTACGTCCTACACCCTGGACC^^ 

l>KfetTrpAs£leuVal LeuSer I » eAI ateuSer Val GlyCysThr Gl yAl aVal Pr 

GO. **AGCTCAGAG:CTAACACC^^ 

oLeu! 1 eGI nS *' Ar 9 1 ' eVal Gl yGI yTrpGi uCysG! uLysHi sSer.GI nProTrpGI nValA 

13 1 CTt^^ACAGTCATGGATGGG^^^ 

41HaValTyrSerHis6JyTfpAlaHisCysGlyGlyValLeoValHisProGI nT rpVal leuThr 
63*AI aAl aHi sCysleulyslysAsnSer Gl nVal TrpLeuGI yA rgHI sAsnLeuPheGI uProGI 
84 ► uAspThr GJyGI nArgVa t ProVa I Ser Hi sSer PheProHi sProLeuTyrAsnkfetSer LeuL 
106> euLy sH, sG! nSer LeuArgProAspGI uAspSer Ser Hi sAspLeute , Leul^gTeuSeT 




128> Gl uProAi aLys 1 1 eThr AspVa I Va I LysVa I LeuGl yLeuProThr Gl nGI uProAlaL.ua 
149> yThr Thr CysTyrAI a Ser 61 y TrpGI ySer 1 1 eGI uProGI uGl uPheLeuArgP roArgSer L 
171> euGI nCysVa I Ser LeuHisLeuLeuSerAsnAspMe t CysAI a ArgAI aTyrSer Gl uLy sVa I 

193 ► Thr Gl uPh*** « LouCysAI aGI y LeuT rpThr Gl yG I y\ys AspThr CysGlyGI yAspSeT Gl 

^^^-^^.TTACCACA.Ca^GTTCCATAGTGTAGTACCCCGGGACTCC^^ 
214 ► yGI y ProLeuVa I CysAsnGI yVa I UuGI nGI y 1 1 eThr Ser TrpGI y ProGI uProCysAI at 

ZSe^euProGluLysProAlaValTyrThrLysValValHisTyrArgLysTrplleLysTyrThrlle 

781 C^X^CCCCTGACT^^ 

258' Al aAl aAsnPro ' 



Pstl 



Figure 5 
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1 GA^TTC^TGVTTGT^r^ 
CT7AAGTACT/V*^ 

1 ► Kte 1 1 leValGlyGlyTrpGI uCysGI uLysHi sSer Gl nPro 

ACCGTCC^CCACAC-.TGTCAGTACCTACCCGTGTGACACCCCCACAG 
lSHrpGI nVa IAI aVa I TyrSer Hi sGI yTrpAl aHisCysGI yGlyVal 

31> LeuVal Hi sP roGI nT rpVa I LeuThr Al aAl aHi sCysLeuLy sLys 

14 5 A^7ACCCAC</TC7CC<^ 
TTATCGGTNTCAGACC^^ 
47 ► AsnSer Gl nVal T rpLeuGI yA rgHi sAsnLeuPheGI uProGI uAsp 

193 ACAGGCCA-XV,3GG?CCCTC^^ 
TGTCCGGTCTCCCAGGGA-^ 
63>ThrGI yGlnArgVal ProVal Ser Hi sSer PheProHi sProLeuTyr 

241 AJVTATGAGCCjTCTGAAG^^^ 

TT ATACTCGGAAG AC 7TCG7A Gn'^iCC-G AA TC7GG'J CTAC ITCTG A GG 
19> AsnMet Ser LeuLeuLysHi sGI nSer LeuAr gPr oAspGI uAspSer 
2S9 A<x:c-iT^ccTcvrrc^ 

TCGGTACTGGAGTACGACGA^ 
95^ Ser Hi sAspLeuMet LeuLeuArgLeuSer Gl uProAl aLys I I eThr 

337 GATGTTG?GAAG£TCCTGGGCCT\?GO 

ctacjac^lttccvgga^ 

111* AspVal Va I LysVa I LeuGI yLeuProThr Gl nGI uProAl aLeuGI y 

385 ACG\CCTO:tACGCCTCA 

TGGTGGACGATGCGGAGTCCGACCCCGTCGTAGCT^ 
127>ThrThr CysTy r Al aSer Gl yT rpGI ySer I leGluProGI uGluPhe 
433 TTC^GCCCCA^.GT^ 

AACGGGGGGTCCTCAGA^^^ 
143> LeuArgProArgSer LeuGI nCysVa I Ser LeuHi sLeuLeuSerAsn 
481 GACATGTGTOCTAGAGC^ 

CTGTAC*CACGA?CTCG>J^ 
159 ► AspMet CysAI aArgAI aTyrSer Gl uLysVa I Thr Gl uPheMet Leu 

529 TGlV^ftSGGClCTGGAC^-G 
ACACCACCCGAGACCTGTC^ 
175> CysAI aGI yLeuT rpThr Gl yGI yLysAspThr CysGl yGI yAspSer 

577 GGGGGTCC^.CTTGTCTGTAATGGTGTGCTTC^ 

CCCCCAGGTGAAOXfttt^ 
191 ► Gl yGI y ProLeuVa I CysAsnGI yVa I LeuGI nGI y I I eThr Ser T rp 
625 GGCCCTGAC£CATCK*XC^ 

CCGGGAC7CGGTACACGGGACGGACTTTTCGGACGACACA 
207> Gl yProGI uProCysA! aLeuProGI uLysProAl aVa i Ty rThr Lys 

673 GToGTGCATTACCGGAAGT^GGATCA 

CACCAOiTAATvK^^ 
223 ► Val Val Hi sTy r ArgLysT rpl I eLysTyrThr 1 1 eAI aAl aAsnPro 
721 T^AGTGCCCCJtTTC^ 

ACTCACGGGGACAGGGTO^ 



Figure 6 
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1 2?7!7377 c ^ T ^^ rcc ^T^ 

!► Va I ProLeu 1 1 eGl nSerArg 1 1 eVal Gl yGlyT rpGI uCysGI uLysHi sSer GI nPro 




21> T rpG! nVa 1 Al aVa I Ty rSer Hi sGI yT rpAf aHi s CysGI yG I yVa ReuVa IHisPro 

121 c^x3Gcn«ncACAc<r^^ 

41> Gl nT rpVa I LeuThr Al aAl aHi s CysLeuLyslysAsnSer Gl nVa I T rpLeuGI yA rg 

61 ► Hi sAsnLeuPheGI uProGI uAspThr Gl yG I nArgVa I ProVa I Ser HisSer PhePro 
241 ^CGCTCTACAATATCftrcC^^ 

81> Hi sProLeuTyrAsnMelSer leuleulysHi sGI nSer LeuArgProAspGI uAspSer 




101 > Ser Hi sAspLeuMa t LeuLeu Ar B LeuSer Gl uP roAl aLys 1 1 eThr AspVa I Va I Lys 



361 




121^ Va I LeuGI yLeuP roThr Gl nGJ uProAl aLeuGI yThr Thr CysTy r Al aSer Gl yTrp 



421 




141>G. ySer neG.uProG.cG.uPheUuA^ProArgSeTre^^vVlS^s 
161* LeuLeuSer AsnAspMe t CysAI a ArgAI aTy rSer Gl uLys Va I Thr Gl uPheMai Leu 




1S1> CysAI aGI yLeuT rpThr Gl yGl yLysAspThr CysGI yGI yAspSer GI yGI y "p"oLeu 




201>Va,CysA S nG.yV a ,LeuG,nGly,,e-^ 

221> Gl uLy sProA. aVa ITyrThr LysVa I Va I HI S Tyr A^^pTte^^T^e 

721 GCAGCCA-^CCCCTGAC^CCCTGICCCACCCCTACCXTAGTAAA 

CCTCG<nTGGSGACTCACGGG:^^^ 
241>AlaAI aAsnPro 



Figure 7 
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